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ABSTRACT 

This  document  is  one  of  a  series  of  volumes  of  the  SPAN 
RHnSRENCE  MANUAL  describing  the  SPAN  system  for  data 
management  and  statistical  analysis.  The  SPAN  system 
provides  an  integrated  file  processing  and  multivariate 
analysis  capability  for  problems  involving  large  data 
matrices  with  large  numbers  of  variables.  Ihe  system 
is  oriented  to  problems  of  social  science  In  general 
and  urban  data  analysis  in  particular. 

Ifcis  volume  describes  the  specification  and  use  of  SPAN 
capabilities  for  the  transformation  of  the  contents  of 
a  data  file  in  the  course  of  its  input  to  a  SPAN  process. 
Itie  data-transformation  capability  allows  the  user  to 
state  algorithmic  procedures  leading  to  the  generation 
of  new  data  variables  as  transformations  of  the  source 
data.  The  entity  stratification  capability  provides 
for  the  classification  and  selection  of  entity  records 
according  to  user-specified  criteria. 
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FOREWORD 

Capabilities,  specifications  formats,  and  other  details 
of  the  application  of  the  data-transforaation  and  entity- 
stratification  languages  described  in  this  volume  correspond 
to  the  following  operational  components  of  the  SPAN  system 

1.  SPAN  System  Library  Version  3  (arid  subsequent 
versions);  and 

2.  SPAN  System  Supervisor  (binary  deck),  dated 
February  12,  1965 . 

The  current  operational  status  of  the  SPAN  system  is 
documented  in  TM-I563/05C' . 
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I.  INTRODUCTION:  THE  STANDARD  INPUT  PROCESS 


A  typical  single  SPAN  task  consists  of  obtaining  information  from  a  source 
file  and  subjecting  it  to  statistical  analysis,  tabulation,  listing,  or  other 
foms  of  processing.  SPAN  provides  the  capability  to  transform  or  stratify 
data  token  from  the  source  file  prior  to  subsequent  processing.  The  methods 
for  accomplishing  this  are  described  in  this  volume. 

As  a  simple  example,  th*»  source  file  might  contain  data  on  the  total  pop’ilation 
and  its  age  distribution  by  census  tracts.  The  data-transformation  capability 
makes  it  possible  to  compute  the  new  variable  "percent  population  over  65" 
for  each  census  tract.  Ihe  stratification  capability  makes  it  possible  to 
stratify  the  tracts  into  groups  according  to  size  of  population.  However, 
the  stratification  process  takes  place  after  data  transformations  so  that 
one  could  stratify  on  the  basis  of  a  newly  created  variable.  Thus,  in  terms 
of  the  above  example,  one  could  stratify  the  tracts  into  groups  according 
to  percent  population  over  65.  Inherent  in  the  stratification  process  is 
the  capability  to  select  or  reject  tracts  on  the  basis  of  specified  criteria. 
The  wide  range  of  possibilities  in  both  data  transformations  and  stratification 
is  discussed  in  the  following  sections. 

The  SPAN  capabilities  for  data  transformations  and  entity  stratifications 
(including  record  selection)  may  be  exercised  in  the  following  SPAN  modules: 

Factor  Analysis 
Regression  Analysis 

Latent  Class  Analysis  (record  selection  only) 

Rectangular  Product  Moments 
File  Transformation 
STARS  Files  Abstractor 
STARS  Files  Collator 
STARS  Files  Tabulator 
STARS  Files  Sunsaary-Sort 
SPAN  Report  Generator 

System  for  Mixed  Data  Structures  Reduction 
SPAN  Graphic  Display  system 


THE  SOURCE  FILE 


Only  those  aspects  of  file  organization  necessary  to  an  understanding  of 
this  volume  will  be  covered  here;  a  complete  discussion  of  file  organization 
is  given  in  TM-1563/015/xx.  Furthermore,  the  discussion  will  be  limited  to 
"simple  files";  a  simple  file  is  one  in  which  each  record  has  the  same  format. 
"Mixed  data  files"  and  their  treatment  are  discussed  in  1M-1 5^3/02 3/xx. 
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The  typical  source  file  consists  of  observations  on  the  properties  of  entities. 
Entities  might  be  census  tracts,  Individuals,  time  intervals,  events,  or  any 
other  "units";  properties  might  be  population,  land  area,  age,  percent 
unemployed,  and  so  forth.  One  can  think  of  the  file  as  a  two-dimensional 
array;  in  SPAN,  by  convention,  rows  are  entities,  and  columns  are  properties. 

A  typical  source  file  is  made  up  of  records,  each  record  consisting  of  the 
values  of  the  properties  for  one  entity. 

The  uroperties  for  a  given  entity  are  of  tvo  types:  codes  and  variables. 

Codes  are  properties  that  identify  the  entity,  such  as  census  tract  code, 
man  number,  X-Y  coordinates,  event  identification,  etc.  A  typical  source-file 
record  peralts  a  total  of  fifty  different  codes.  Variables  are  those  properties 
such  as  population,  age,  etc.,  that  represent  the  computational  data  for  the 
entities.  Typically,  a  ma::ii«um  of  1000  variables  is  possible.  Within  a 
source  file  record,  the  set  of  codes  is  logically  separate  from,  and  precedes, 
the  set  of  variables. 


THE  STANDARD  INPUT  PROCESS 

A  file  is  processed  serially,  that  is,  a  single  record  is  obtained,  transformed, 
stratified,  and  subjected  to  such  further  processing  as  specified,  and  then 
another  record  is  obtained.  For  ease  of  discussion,  we  shall  define  two 
internal  regions,  the  I-region  and  the  X-region.  As  each  record  is  obtained 
from  the  source  file,  the  codes  are  placed  in  the  I-region  and  the  variables 
in  the  X-region. 

Obviously  the  user  specifies  whether  a  property  is  a  code  or  a  variable. 

However,  codes  and  variables,  as  will  be  seen,  differ  in  the  manner  in  which 
they  can  be  operated  upon,  and  sometimes  the  user  may  find  it  necessary  to 
include  the  same  property  both  as  a  code  and  as  a  variable. 

Bach  code  or  variable  corresponds  to  one  location  in  the  I-region  or  X-region 
respectively.  (The  internal  representation  of  codes  and  variables  will  be 
discussed  later.)  Ihc  I-region  can  accommodate  100  codes;  since  the  typical 
record  contains,  at  most,  50  codes,  additional  space  is  available  for  new 
codes  arising  from  data  transformations.  Codes  are  identified  by  th-ir 
location  in  the  I-region,  e.g.,  1(1),  1(2),  i(lOC).  Codes  l(99)  and 

1(100)  are  sometimes  used  for  specific  purposes  by  SPAN  nodules,  and  are 
therefore  not  always  available.  The  X-region  normally  accommodates  1400 
variables,  identified  by  their  location,  e.g.,  X(l),  X(2),  etc.  Ihe  typical 
record  contains  a  maximum  of  1000  variables;  additional  3pace  is  thus  available 
for  new  variables  arising  from  transformations.  Note  that  if  the  record 
contained  only  200  variables,  locations  X(201)  through  X(l4O0)  would  be 
available;  a  similar  comment  applies  to  the  I-region.  Ihe  I-region  and 
X-region  ore  automatically  cleared  prior  to  the  processing  e**  the  first  record. 
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Referring  to  Figure  1-1,  we  see  that  a  record  is  taken  frctn  the  source  file, 
and  the  codes  and  variables  are  placed,  respectively,  in  the  I -region  and 
X-region.  If  any  data  transformations  or  stratification  are  to  take  place, 
or  any  subsequent  computations  axe  to  be  made,  as  distinct  from  merely 
moving  data,  all  of  the  variables  are  automatically  converted  to  floating  point, 
if  they  are  not  already  in  that  form. 

Data  transformations  may  be  made  on  variables  or  codes.  Transformations  on 
variables  are  normally  floating-point  arithmetic  operations;  that  is,  new 
variables  are  created  by  arithmetic  operations  on  existing  variables.  Codes 
may  be  transformed  by  means  of  Boolean  or  integer  operations. 

Stratification  permits  the  user  to  assign  stratum  numbers  to  each  entity 
according  to  a  wide  range  of  user-specified  conditions.  Thus,  if  one  wishes 
to  stratify  a  file  of  records  into  five  groups  according  to  the  value  of  a 
given  variable,  one  might  assign  the  stratum  numbers  1,  2,  ...  5.  As  the 
stratification  specifications  are  executed,  the  stratum  number  for  each 
record  is  automatically  placed  in  1(99)  and  is  available,  if  desired,  in 
subsequent  processing. 

If  a  record  does  not  meet  the  specified  conditions,  the  stratum  number  is 
automatically  set  at  zero,  and  such  a  record  is  rejected;  i.e.,  it  does  not 
undergo  subsequent  processing.  Thus,  selection  is  inherent  in  stratification, 
and  as  a  special  case,  one  can  select  any  particular  records,  and  thus  reject 
others,  by  assigning  any  non-zero  integer  as  the  stratum  number  to  the 
records  to  be  selected. 

Each  record  from  the  source  file  undergoes  the  processes  shown  in  Figure  1-1, 
and  the  resulting  records  can  be  thoughtof  as  making  up  a  derivative  file. 

It  is  this  derivative  file  that  is  the  "input"  for  subsequent  processes. 

Most  SPAN-specific  processes,  such  as  abstracting,  summarizing,  and  collating, 
or  calculating  a  correlation  matrix,  are  specified  with  respect  to  the 
derivative  file.  Tills  transformation  of  source  file  into  derivative  file  is 
known  as  the  standard  input  process.  If  n^  transformations  or  stratification 
are  specified,  the  derivative  file  will  be  identical  to  the  source  file. 
Implications  of  the  standard  input  process  for  identifiers  of  file  content, 
ouch  as  labels  of  variables  or  names  of  the  codes,  variables,  and  entities 
sets  in  a  STARS  file,  are  discussed  under  the  heading  of  STARS  output  file 
description  in  the  SPAN  REFffiEJCE  MANUAL  volume  on  "Data  File  Manipulation 
and  Processing, "  TM-1563/021/xx. 

In  thv.  following  two  chapters  the  rules  for  specifying  data  transformations 
and  entity  stratification  are  discussed.  The  reader  should  assume  that  the 
data  are  already  in  the  I  and  X-reglon  and  that  he  knows  the  positions  of 
the  various  codes  and  variables. 
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THE  STANDARD  INPUT  PROCESS 


Figure  1-1 
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II.  DATA  TRANSFORMATIONS 


Data •transformation  specifications  are  used  to  create  new  codes  or  variables.* 
Newly  created  codes  and  variables  can  be  used  in  subsequent  processing  and 
to  create  additional  codes  and  variables.  Data-transformation  specifications 
are  executed  on  each  entity  record,  as  described  in  Chapter  I.  The  rules  for 
specifying  data  transformations  are  discussed  in  this  chapter. 

Data-transformation  specifications  are  composed  of  an  arbitrary  collection  of 
statements,  of  which  there  are  three  types:  (l)  transformation  statements, 
used  to  specify  arithmetic  computations ;  (2)  condition  statements,  used 
to  control  the  sequence  in  which  statements  are  executed;  and  (3)  special 
statements.  Statement  are  made  up  of  expressions  and  these,  in  turn,  of  terms. 
These  and  other  elements  of  the  data-transformation  specification  are  discussed 
below. 


TERMS 


A  term  consists  of  an  operand  prefixed  by  its  operator.  There  are  three 
types  of  terms:  (1)  arithmetic  terras,  (2)  Boolean  terms,  and  (3)  integer 
terms.  The  type  of  term  is  uniquely  determined  by  the  combination  of  operator 
and  operand;  the  same  symbol  is  often  used  for  operators  in  different  types 
of  terms,  even  though  it  represents  a  different  operation. 

Figure  Il-l  lists  the  various  possible  operands.  It  will  be  recalled  that 
X(15)  refers  to  the  15th  location  in  the  X-region,  which  corresponds  to 
the  15th  variables;  1(23)  refers  to  the  23rd  location  in  the  I-region, 
which  corresponds  to  the  23rd  code.  Note  that  in  transformations  text 
constants  are  always  enclosed  in  parentheses. 


*  The  SPAN  data-transformation  language  described  in  this  chapter  was 
conceived  and  implemented  by  Robert  A.  Hoodes,  who  participated  in 
various  phases  of  SPAN  development.  The  principal  virtue  of  the 
notation  is  its  easy  translatability,  leading  to  a  sisple  but  highly 
efficient  execution  method.  An  early  version  of  the  language  was  first 
used  by  Mr.  Hoodes  in  his  IBM  704  MUSP  system  for  multivariate 
analysis.  Mr.  Hoodes  is  now  with  the  IBM  Federal  Systems  Division. 
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TYPES  OF  OPERANDS 

Symbol 

Name 

Example 

V 

Floating  point  variable 

X(15) 

F 

Function  of  variable 

SORT  X(15) 

D 

Decimal  number  constant 

(-35) 

Cl 

Decimal  integer  code 

1(23) 

C2 

Octal  integer  code 

1(23) 

C3 

Alphameric  code 

1(23) 

C4 

Complement  of  octal  Integer  code 

-1(15) 

KL 

Decimal  integer  constant 

(11) 

K2 

Octal  Integer  constant 

(077077 ) 

K3 

Alphameric  constant 

(HA5PX43) 

Figure  II-l 


VARIABLES  AND  FUNCTIONS  OF  VARIABLES  (V  or  F) 


General  Form 


A  variable  is  represented  by  X(n),  where  n 
is  the  index  of  its  location  in  the  X-region. 


Variables,  such  as  X(65),  are  represented  internally  as  floating-point  numbers 


numbers.  Figure  II-2  lists  all  defined  functions. 
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FUNCTIONS 

OF  VARIABLES 

Square  root 

SORT  X(90) 

Logarithm  (natural) 

L0Q  X(90) 

Exponential 

EXP  X(90) 

Sine 

SIN  X(90) 

Cosine 

C0S  X(90) 

Arctangent 

AIAN  X(90) 

Figure  II-2 


The  function  of  a  variable  is  a  single  operand.  Where  the  function  of  a 
variable  would  not  yield  a  real  number  of  allowable  magnitude,  as  In  (-4)2, 
log  0,  or  e^°,  the  value  of  the  function  is  set  to  -0. 

DECIMAL  NUMBER  CONSTANTS  (D) 


General  Form 


A  decimal  number  constant  consists  of  a  string  of  decimal 
digits  with  a  decimal  point  at  the  beginning,  at  the  end, 
or  between  two  digits. 


The  decimal  number  constant  may  be  signed  or  unsigned.  A  decimal  number 
is  represented  internally  as  a  floating-point  number.  .The  magnitude  of  a 
decimal  number  constant  must  lie  between  10-1^  and  10^\  or  be  aero. 
Examples:  (-.15),  (3*l4l6),  (12.)  or  (0.). 


DECIMAL  INTEGERS  (Cl  or  KL) 


General  Form 


A  decimal  integer  constant  consists  of  1-5  decimal  digits 
written  without  a  decimal'  point. 


A  decimal  integer  code  is  represented  by  l(n),  where  n  is 
the  index  of  its  location  in  the  I-region. 
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A  decimal  integer  may  be  signed  or  unsigned.  Its  magnitude  must  be  less  than 
2^5.  a  decimal  integer  constant  would  be  expressed  as  (1492),  (-2),  or  (0). 

A  decimal  integer  code  is  represented  by  its  address,  for  example,  1(92)* 
Decimal  integers  are  stored  internally  as  pure  binary  numbers,  occupying 
the  left-half  word,  with  the  right  18  binary  bits  being  all  zeros.  If  a 
decimal  integer  is  negative,  the  minus  sign  occupies  the  high-order  position 
of  the  binary  word;  this  must  be  remembered  when  using  negative  integers  in 
Boolean  operations. 


OCTAL  INTEGERS (C2  or  K2) 


General  Form 


An  octal  integer  constant  consists  of  1*12  octal  digits 
preceded  by  the  letter  0. 

An  octal  integer  code  is  represented  by  l(n),  where  n 
is  the  index  of  its  location  in  the  I-region. 


The  maximum  value  of  an  octal  integer  is  2^  .  Leading  zeros  need  not  be 
indicated.  Examples:  (0356),  (0T7TTOOOOT7TT),  (03)-  An  octal  integer 
code  is  represented  by  its  address,  for  example,  l(92).  Octal  integers 
are  stored  internally  as  pure  binary  numbers,  each  octal  digit  comprising 
three  binary  bits. 


ALPHAMERIC  CONSTANTS  AND  CODES  (C3  or  K3) 


General  Form 


An  alphameric  constant  consists  of  1-6  alphabetic 
and/or  numeric  characters  preceded  by  the  letter  H. 

An  alphameric  code  is  represented  by  l(n),  where  n 
is  the  index  of  its  location  in  the  I-region. 


An  alphameric  constant  may  not  include  special  characters.  Ihe  H  to  indicate 
mode  must  be  used  even  if  word  begins  with  an  H.  Examples:  (HABC),  (H2376), 
(H23KV6),  (HHAPPY).  If  the  string  exceeds  six  characters,  only  the  six 
right-most  characters  are  used.  For  example,  (HALPHAMERIC)  is  equivalent  to 
(HAMERIC),  i.e.,  the  last  six  characters. 
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An  alphameric  code  is  represented  by  its  address,  for  example:  l(92) . 

Alphameric  words  are  stored  internally  as  binary  numbers,  with  two  octal  digits, 
and  therefore  six  binary  bits,  for  each  character.  They  are  left-justified, 
so  that  the  right  characters  consist  of  blanks.  The  IBM  7090  octal  (storage) 
representation  of  alphameric  characters  is  given  in  Appendix  A  ,  "7090 
Representation  of  Alphameric  Characters." 

COMPLEMENT  OF  OCTAL  INTEGER  CODE  (c4) 

The  l's  complement  of  an  octal  integer  code  is  indicated  by  placing  a  f-"l 
symbol  ahead  of  the  address  of  a  code,  for  example,  -1(15).  The  combination 
represents  a  single  operand. 


ARITHMETIC  TERMS 

Arithmetic  terms  may  be  simple  or  complex.  A  simple  arithmetic  term  consists 
of  one  operator  and  one  operand.  A  complex  arithmetic  term  represents  a 
single  operation  performed  on  more  than  one  operand.  Figure  II-3  lists  the 
operators  that  may  be  used  in  simple  arithmetic  terms;  Figure  II-k  lists  the 
two  types  of  complex  terms.  Only  certain  operands  may  be  used  with  a  given 
operator;  these  are  indicated  in  Figures  II-3  and  II-4  by  means  of  the 
operand  symbols  defined  in  Figure  II-l. 


1  SIMPLE  ARITHMETIC  TERM  OPERATORS  | 

Operator 

Operation 

Applicable  to  Operand  Types 

+ 

Add 

V,  F,  D 

- 

Subtract 

V,  F,  D 

* 

Multiply 

V,  F,  D 

/ 

Divide 

V,  F,  D 

T0 

Store  and  proceed 

V 

- 

Store  and  clear 

V 

Figure  II-3 

Division  by  zero  will  set  the  quotient  to  -0.  The  difference  between  the  two 
"store"  operators  will  be  discussed  in  the  section  on  egressions. 
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|  COMPLEX  ARITHMETIC  TERMS  | 

Term 

Operation 

Admissible  Operand  Types 

SUM  x.  THRU  x2 

Sum  operands  through 

x2  1 

V 

MED  x^  THRU  xg  INT 

Calculate  median  over 
variables  x^  through  x2, 
with  x.  being  the  first 

element  of  a  set  of 
the  lower  limits  of 
corresponding  category 
intervals;  if  n  is  the 
number  of  categories,  the 
(n+l)th  element  contains 
the  upper  limit  of  the  nth 
interval . 

V 

Figure  II -4 

A  detailed  example  of  a  median  computation  will  be  given  later. 


BOOLEAN  TERMS 


Boolean  teiros  consist  of  one  operator  and  one  operand.  They  are  listed  in 
Figure  II-5*  Only  certain  operands  may  be  used  with  a  given  operator;  these 
are  indicated  in  Figure  n-5  with  the  operand  symbols  defined  in  Figure  II-l. 


j  BOOLEAN  TERM  OPERATORS  | 

Symbol 

Operation 

Applicable  to  Operand  Types 

+ 

logical -or 

All  C  and  K 

* 

logical -and 

All  C  and  K  j 

/ 

exclusive -or 

All  C  and  K 

LEFT 

shift  left  n  bits 

KL 

RIGHT 

shift  right  n  bits 

Kl 

T0 

8 tore  and  proceed 

All  C 

x 

8 tore  and  clear 

All  C 

Figure  II-5 
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For  Boolean  operations,  including  shifts,  recall  that  all  operands  are 
internally  represented  as  pure  binary  quantities  of  36  binary  bits.  Shifts 
are,  in  effect,  accumulator  shifts,  not  "long"  or  "logical"  shifts;  therefore 
bits  are  lost  in  this  process.  On  a  left  shift  of  n  bits,  (n-1)  bits  are  lost, 
while  n  bits  are  lost  on  a  right  shift  of  n  bits.  Vacated  positions  are  filled 
with  zeros. 


INTEGER  TERMS 

Integer  terms  consist  of  an  operator  and  an  operand.  The  operators,  and 
applicable  operands,  are  given  in  Figure  II-6. 


|  INTEGER  TERM  OPERATORS 

Symbol 

Operation 

Applicable  to  Operand  Types 

+X 

Integer  Add 

Cl 

™X 

Integer  Subtract 

Cl 

*x 

Integer  Multiply 

a 

/X 

Integer  Divide 

a 

-X 

Store  and  Clear 

Cl 

Figure  II -6 


EXPRESSIONS 

An  expression  is  a  string  of  terms,  none  of  which  contains  the  operator  (»] 
or  £»xij.  Each  line  below  is  an  example  of  one  expression: 


Expression  Type 

X(52)  -  X(43)  /  (2.84)  Arithmetic 

1(2)  *  (0770000)  LEFT  (18)  ♦  (t^UKE)  Boolean 

(.55)  -  SQRTX(55)  -  X(12)  Arithmetic 

(0707)  /  1(87)  ♦  -1(3)  Boolean 

XI(52)  /X  1(33)  *X  1(1)  Integer 

MED  X(l4)  THRU  X(2l)  HIT  X(38)  T0  X(8o)  *  (2.)  Arithmetic 
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Blanks  are  Ignored  in  expressions,  so  that  expressions  equivalent  to  the 
above  can  be  written  in  many  ways.  In  the  above  examples,  each  of  the 
following  represents  a  single  term:  -X(43);  LEFT  (l8);  -  SQRIX(55);  -  -1(3); 
/X  1(33);  T0  X(80) ;  MED  X(l4)  THRU  X(21)  INT  X(38). 


RULES  FOR  CONSTRUCTING  EXPRESSIONS 


1.  All  terms  in  an  expression  must  be  of  the  same  type;  e.g.,  all  arithmetic 
terms,  Boolean  terms,  or  integer  terms.  Mixed  expressions  are  not  detected 
by  the  translator  and  will  yield  incorrect  results. 

2.  A  complex  arithmetic  term  may  appear  only  as  the  leading  term.  Thus  the 
following  is  a  correctly  formed  expression, 

SUMX(35)  THRU  X(4o)  *  X(2)/(100.) 

while  the  following  expression  is  in  error, 

X(2)  *  SUMX(35)  THRU  X(40)/(l00.) 

3.  For  expressions  made  up  of  either  simple  arithmetic  or  Boolean  terms  the 
operator  of  the  leading  term  is  always  an  implicit  [+ )  .  That  is,  no 
operator  except  the  f+ J  may  be  used  for  the  leading  term,  and  the  [+ ] 

is  never  written.  (See  the  previously  given  examples.) 

4.  Expressions  are  evaluated  from  left  to  right;  the  operator  symbol  in  each 
term  defines  the  relation  between  the  result  of  preceding  terms  and  the 
value  xeferred  to  by  the  operand.  In  the  expression  X(5 2)  -  X(43)  /  (2.84) 
the  quantity  X(43)  is  subtracted  from  X(52)  and  the  result  is  then  divided 
by  (2.84).  If  one  wanted  to  compute  X(43)  divided  by  (2.84)  and  the 
quotient  to  be  then  subtracted  from  X(52)  one  would  need  to  first  perform 
the  division  and  then  store  the  quotient  in  a  temporary  location,  as  will 
be  discussed  in  the  section  on  Statements.  In  some  cases,  other  methods 
can  be  used;  thus,  if  the  operator  ahead  of  X(43)  were  a  ♦  instead  of  a  -, 
one  could  write  X(43)  /  (2.84)  ♦  X(5 2).  In  general,  if  $  is  a  typical 
operator  and  x  is  a  typical  operand,  then  the  expression 

xlV2®3*3!V'. 


is  evaluated  as  if  it  read 

(((x1®2x2)®jX3)«^xJt) 

However,  parentheses  may  not  be  used  for  this  purpose  In  an  expression. 
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TRANSFORMATION  STATEMENTS 


General  Form 


A  transformation  statement  is  an  expression  terminated  by  a 
term  whose  operator  is  (■ J  or  [«X ]. 


Examples  of  transformation  statements  are: 

X(52)  -  X(43)  /  (2.84)  -  X(3) 

1(2)  *  ( 0770000 )  LEFT  (l8)  ♦  (H0UKE)  -  l(2) 

X  1(2)  +X  1(2)  -X  1(4) 

The  [*)  and  [«X]  operator  stores  the  result  of  the  expression  in  the  location 
represented  by  the  operand,  that  is,  in  a  location  in  the  I-reglon  or  X-region. 
A  term  whose  operator  is  f«]  or  f»XJ  is  not  part  of  the  expression.  A  term 
whose  operator  is  or  fT0X  ]  is  part  of  an  egression.  It  stores  the 
results  of  the  previous  terms  into  the  location  represented  by  its  operand. 

The  results  are  also  immediately  available  for  the  next  term-  Thus,  in  the 
statement 


X(l)  *  X(2)  T0  X(3)  ♦  X(4)  .  X(5) 

the  sum  of  X(l)  ♦  X(2)  is  stored  in  X(3),  is  then  added  to  X(4),  and  the 
final  result  stored  in  X( 5 ) *  However,  if  the  total  stored  in  X(5)  is  to  be 
used  again,  it  is  obtained  frcm  storage  by  explicitly  indicating  X(5)  in  a 
subsequent  statement.  All  of  the  terns  in  a  statement  must  be  of  the  same 
type. 

Effect  of  Transformation  Statements  on  Variable  Labels 

A  source-file  variable  may  be  the  operand  of  a  or  (*)  operator •  It 
should  be  noted,  however,  that  whenever  the  value  of  a  source  variable  la 
replaced  in  this  manner  by  a  result  of  data  transformations,  the  label 
associated  with  the  source  variable  is  set  to  zero,  and  the  source  variable 
Is  no  longer  available  for  further  processing.  Procedures  for  associating 
lsbels  with  newly  computed  variables  are  discussed  in  TM-1563/G2l/xx. 


I 
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STATEMENT  LABELS 


General  Font 


A  a tat anent  label  Is  a  string  of,  at  noet,  six 
alphanumeric  characters  bracketed  by  4  symbols. 


Any  statement  say  be  assigned  a  label.  For  example: 


4  A51  4;  $  KEXT  4;  4  Ulill  $. 

The  label  Is  placed  ahead  of  a  statement  thus 

4  NC*  4  X(l)  -  X(2)  .  X( 3) • 

Blanks  In  the  label,  as  elsewhere  In  the  transformations  teat,  are  Ignored. 
A  label  may  stand  alone  at  the  end  of  the  transformations  text  to  penult  a 
terminal  transfer  to  the  end  of  specifications. 


CONDITION  STATEMENTS 


General  Font 


Condition  statements  are  of  the  fora 

a1  •  a2  00  70  e 

where  a^  and  eg  are  expressions, 

•  is  a  condition  operator,  and 
a  is  a  statement  label. 

Condition  atataments  make  It  possible  to  control  the  sequence  in  which 
statements  are  executed. 
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If  the  condition  asserted  by  the  condition  statement  is  true,  transfer  is 
made  to  the  statement  identified  by  the  label.  Otherwise,  the  statement 
following  the  G0  T0  command  is  executed.  Integer  terms  are  not  permitted 
in  the  condition  statement.  In  a  condition  statement  a  complex  arithmetic 
term  (for  definition,  see  Figure  II-4)  may  appear  only  as  the  leading  term 
of  the  left-hand  expression.  Both  expressions  in  a  condition  statement 
must  contain  the  same  type  of  terms.  A  list  of  condition  operators  is 
given  in  Figure  II-7. 


|  CONDITION  OPERATORS 

Symbol 

Condition  Asserted 

EQ 

Ekjual 

Greater  lhan 

LS 

Less  lhan 

GQ 

Greater  or  Equal 

14 

Less  or  Equal 

Figure  II-7 

Examples  of  condition  statements  are: 

X(3)  +  (3-0)  ®  X(25)  -  L0 G  X(13)  G0  T0  $  NEXT  $ 

X(5)  BQ  (3.)  00  T0  $  5  $ 

1(3)  BQ  (HPHILAD)  00  10  $  AGAIN  $ 

1(5)  *  (0777700007777)  BQ  (HABOOCD)  00  10  $  SMILE  $ 

If  the  values  of  two  expressions  made  up  of  arithmetic  terms  are  compared, 
sero  Is  greater  than  minus  zero.  If  the  values  of  two  expression*  made  up 
of  Boolean  terms  are  compared,  zero  is  less  than  minus  zero,  since  the  minus 
sign  is  the  high-order  bit. 


SPECIAL  STATEMENTS 


There  are  two  special  statements:  the  unconditional  transfer  and  the 
weighting  function. 
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UncomlUlon  Transfer  Statement 


General  Form 
G#  T0  a 

where  s  Is  a  statement  label. 


For  example,  the  statement 


Ctf  T0  $  HERD  $ 

causes  the  statement  labeled  HERE  to  be  executed  next. 
Weighting  Function 


General  Form 


WGHT  x1  THRU  Xg  BY  x3 

where  x1  and  x2  are  respectively  the  first  and 

the  last  of  a  range  of  variables  to  be 

weighted,  and 

is  a  multiplier  variable. 


flie  weighting  function  provides  for  the  multiplication  of  each  0f  a  set  of  X-retf 
variables  by  another  variable.  For  example,  the  otat«aent 

WGHT  X(5)  THRU  X(l2)  BY  X{40) 

results  in  replacement  of  each  variable  from  X(5)  through  X(18)  by  its  value 
multiplied  by  the  value  contained  in  X{4o). 
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Effect  of  the  Weighting  Function  on  Variable  Labels 

Although  the  weighting  function  may  he  used  to  modify  the  values  of  source 
variables,  labels  associated  with  the  source  variables  are  not  affected  by 
this  process.  If  the  weighting  function  is  used  to  define  new  variables,  the 
operands  of  the  weighting  function  must  appear  elsewhere  in  the  transformations 
specifications  also  as  operands  of  the  {*$3  or  [=1  operators. 


HOW  X  SPECIFY  DATA  TRANSFORMATIONS  IN  A  SPAN  JOB 


Data- transformation  specifications  are  composed  of  an  arbitrary  collection  of 
transformation  statements,  condition  statements,  and  special  statements  that 
are  stated  ir  the  particular  order  in  which  they  are  to  be  evaluated.  Data- 
trans format ion  specifications  that  are  to  operate  on  the  data  input  to  a 
particular  SPAN  job  must  be  included  as  a  separate  sentence  among  the  control 
specifications  for  that  job.  In  a  control  sentence,  the  specifications  text 
takes  the  form  of  a  string  predicate  of  an  appropriate  control  word;  that  is, 
the  text  is  delimited  by  apostrophies  and  preceded  by  a  transformations - 
declaring  control  word. 


General  Form 


TRANSForm  Nj 

F- TRANs form  >  s 

0)  WANs  form  J 

where  s  is  the  data-transformation  specifications 
text  (S6000),  and 

©stands  for  source-file  number  in  the  case 
of  multifile  input  (0=  1,2,3  in  the  STARS 
Files  Collator). 


A  simple  example  of  data-transformation  specifications,  showing  the  control 
word  and  the  specifications  text  delimited  by  apostrophies,  would  be 

TRANSFORMATIONS  '  X(2)  ♦  X(3)  *  X(5l)  SQRTF  X(2)  =  X(2)  ’ . 

In  most  SPAN  modules,  data  transformations  are  declared  by  the  control  word 
IRANSForm.  In  the  STARS  Files  Tabulator  and  STARS  File  Transformation  modules, 
where  transformations  can  be  expressed  on  results  of  certain  subsequent 
processes  in  addition  to  those  expressed  on  the  input  data,  control  word 
TRANs  form  declares  input  data  transformations  and  the  word  F-TRANsform  is 
reserved  for  transformations  on  data  resulting  from  subsequent  processes.  In 
the  STARS  Files  Collator  module,  where  separate  transformation  specifications 
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can  be  declared  on  data  from  any  one  of  three  possible  source  files,  control 
words  i) TRANs form,  2) TRANs form,  3)TRANSferra  introduce  the  first, 
second,  and  third  file-data  transformations  respectively.  Definitive 
Information  on  control  words  usage  in  a  particular  program  is  given  in  the 
SPAN  Control  Words  Glossary,  TM-1563/013/xx. 

Within  the  actual  specifications  text,  blanks  and  commas  are  Ignored.  Terms 
and  statements  need  not  be  separated  in  any  special  way;  or,  if  desired, 
statements  may  be  separated  by  commas .  Different  transformations  may  appear 
on  the  same  card,  or  on  separate  cards  indented  for  readability.  For  example, 
the  following  two  sets  of  specifications  are  equivalent: 

TRANSFORM  'X( 5)+(3 . )BS5X(29)G0rI^l$L0GX(l3)=X(l3)G0T^ 
$2$$l$I^GX(lU)=X(l3)$2$X(l)-X(2)*X(l00l») ' . 

TRANSFORM  •  X(5)  +  (3.)  BQ  X(29)  G0  T0  $1$ 

I^G  X(13)  =  X(13),  G0  T0  $2$ 

$1$  I^G  X(l4)  =  X(13), 

$2$  X(l)  -  X(2)  =  X(lOOk) 

According  to  the  rules  for  constructing  control  specifications  in  SPAN  (see 
TM-1563/012/xx),  it  is  possible  to  intersperse  comments  and  other  optional 
text  among  the  elements  of  a  control  sentence.  It  xaay  be  also  desirable  to 
associate  explanatory  notes  with  Individual  statements  in  the  data- 
transformation  text.  The  following  example  illustrates  this  flexibility: 

CALCULATE  THE  F$LL0WING  *  TRANSFORMS  OF  INPUT  DATA  — 

*X(3)  /  X(25)  =  X(U2T) 1  GR0SS  P0PULATI0N  DENSITY 

'X(20)-X(25)=X(600),  X(3)/X(600)«  X(ll28) '  NET  RES. DENSITY. 

If  one  prefers,  for  clarity,  to  write  transformation  statements  on  separate 
cards,  enclosing  the  statements  by  apostrophies  may  serve  yet  another  purpose. 
The  text  of  data- trans formation  specifications  input  to  a  single  SPAN  job  may 
not  exceed  6000  characters,  including  blanks  and  commas.  By  enclosing 
individual  statements  in  apostrophies  and,  thus,  purging  the  specifications 
text  of  unnecessary  characters,  a  maximum  of  meaningful  text  can  be  accommodated 
in  a  control  sentence  without  sacrificing  readability.  Care  must  be  taken, 
however,  that  the  control  sentence  contain  an  even  number  of  the  delimiting 
apostrophies . 

DATA  TRANSFORMATIONS:  LIMITATIONS 


Data-trans formation  specifications  input  .to  the  STARS  Files  Collator  module 
may  not  contain  any  arithmetic  terms;  that  is,  transformations  may  be  performed 
on  I-region  data  only.  No  similar  restriction  applies  to  specifications  input 
to  other  SPAN  modules. 
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The  size  of  internal  tables  used  in  the  translation  of  the  data- transformation 
specifications  sets  limits,  in  a  particular  SPAN  job,  on  (l)  the  length  of  the 
specifications  text,  (2)  the  number  of  transformation  terms  permitted,  and 
(3)  the  number  of  constants  permitted.  These  limits  sure  shown  in  Figure  II-8. 
Appropriate  diagnostics  are  printed  if  these  limits  are  exceeded.  In  addition, 
space  allocated  to  receive  the  source  data  and  data  transforms,  the  I-region 
and  X-region,  is  of  defined  size.  References  beyond  the  limits  of  the 
I-region  or  X-region  as  defined  in  Figure  II-8,  i.e.,  references  such  as 
X(6073)  or  1(375) t  are  not  detected. 

In  all  SPAR  modules,  the  data-trans formation  text  associated  with  a  control 
word  may  not  exceed  6000  characters  in  length,  (in  the  STARS  Files  Collator, 
where  an  independent  set  of  data  transformations  can  be  specified  on  each  of 
three  possible  source  files,  this  limit  applies  to  each  set  of  specifications 
separately.) 


DATA  TRANSFORMATIONS :  LIMITATIONS 


Module 

I-Region 

Size 

X-Region 

Size 

Maximum  Trans¬ 
formation  Terms 

Maximum 
Cons  tarn  ts 

(no.  of 
elements ) 

(no.  of 
elements ) 

(no.  of  terms) 

(no.  of 
constants) 

Factor  Analysis 

100 

bOO 

500 

100 

Regression  Analysis 

100 

bOO 

500 

100 

Rectangular  Product 
Moments 

100 

ibOO 

1000 

200 

File  Trams  formation 

100 

1600 

IOOO* 

200* 

STARS  Files  Abstractor 

100 

1400 

1000 

200 

STARS  Files  Collator 

IOO** 

1000** 

1*50*## 

90*** 

STARS  Files  Tabulator 

100 

1400 

1000* 

200* 

STARS  Files  Summary-Sort 

100 

li+oo 

1000 

200 

p Combined  total  for  input  and  subsequent  process  data  transformations. 
l**For  each  of  three  possible  source  files. 
r^Combined  total  for  all  source  files. 


Figure  II-8 
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The  maximum  of  transformation  terms  permitted  refers  to  simple  terms.  For  the 
purpose  of  calculating  this'  maximum,  the  SUM  and  MED  complex  terms  are 
equivalent  to  two  and  three  simple  terms  respectively.  The  00  TO  term,  whether 
appearing  in  a  condition  statement  or  an  unconditional  transfer  statement, 
counts  as  a  simple  term.  The  comparison  operator  combined  with  the  first  term 
of  the  right-hand  expression  in  a  condition  statement  counts  as  a  simple  term. 

The  maximum  of  constants  permitted  refers  to  each  actual  appearance  of  a 
constant  in  an  expression.  Statement  labels  do  not  enter  into  the  count  of 
terms  or  constants.  There  is  no  practical  limit  on  the  number  of  labels. 


TRANSFORMATIONS  TEXT  ERROR  DETECTION  AND  DIAGNOSTICS 


Before  a  file  is  actually  processed,  the  data-transformation  specifications 
text  is  translated  into  an  internal  representation  designed  for  more  efficient 
execution.  The  translator  detects  certain  errors  in  the  specifications  text 
and  prints  appropriate  diagnostic  messages.  Errors  detected  are  those  relating 
to  (l)  excessive  size  of  the  trans formations  text,  (2)  use  of  illegal  operators 
and  operands,  (3)  violation  of  restrictions  on  the  number  of  elements  of  a 
specifications  text,  and  (4)  incorrect  labeling  of  statements.  Other  violations 
of  data-transformation  specification  rules,  such  as  use  of  mixed  expressions, 
are  not  detected.  A  detected  error  in  data-transformation* specifications 
renders  a  SPAN  job  non-executable.  Control,  in  that  case,  passes  to  the  next 
job  in  line. 

Diagnostic  messages  which  may  result  during  processing  of  data-transformation 
specifications  are  listed  below,  with  comments  and  examples  where  appropriate. 

Excessive  Size  of  Transformations  Text 


*♦»  DATA-TRANSFORMATION  SPECIFICATIONS  TEXT  IS  TOO  LONG 
‘OVER  6000  CHARACTERS). 

This  diagnostic  message  results  from  violation  of  the  requirement  that  the 
transformations  text  associated  with  a  control  word  not  exceed  the  maximum 
length  of  6000  characters.  In  calculating  the  length  of  the  transformations 
text,  note  that  each  string- terminating  apostrophe  used  in  the  control 
sentence  may  cause  1-5  blank  characters  to  be  appended  to  the  string  to 
maintain  the  internal  string  length  as  on  integral  multiple  of  six  characters. 
Blanks  so  generated  would  contribute  to  the  character  count  of  the  string. 
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Illegal  Use  of  Operators  and  Operands 

OPERATOR  IS  NOT  IN  OPERATIONS  TABLE, 

An  illegal  operator  appears  in  the  specifications  text,  as  shown  in  the 
following  example: 

***  TRANSFORMATIONS  PROGRAM  DIAGNOSTIC 
OPERATOR  IS  NOT  IN  OPERATIONS  TABLE. 

ERROR  SENSED  WHILE  SCANNING  THE  UNDERLINED  WORD  IN  THE 
FOLLOWING  PORTION  OF  THE  SOURCE  TEXT  — 

XI(1)  *  XI(15)  T0  XI(2U)  + 

(T0X)  is  no*  a  legal  integer  term  operator  (see  Figure  II-6). 

FIRST  CHARACTER  OF  A  CONSTANT  IS  ILLEGAL. 

The  first  character  of  a  constant  is  a  non-numeric  character  other  than  H, 
0,  or  -  (minus)  sign;  for  example, 

(GABC),  ((HHAFFY). 

ILLEGAL  CHARACTER  IN  AN  OCTAL  CONSTANT. 

Octal  constant  contains  a  character  other  than  digits  0  througi  7;  for 
example, 

(0  380). 

ILLEGAL  CHARACTER  IN  A  DECIMAL  CONSTANT. 

Decimal  number  or  integer  constant  contains  a  character  other  than  the 
-  (minus)  sign  or  digits  0  through  9;  for  example, 

(136A). 


Violation  of  Certain  Restrictions 


TOO  MANY  OPERATIONS  SPECIFIED  IN  SOURCE  TEXT. 

The  maximum  number  of  terms  permitted  in  the  specifications  text  has  been 
exceeded.  See  Figure  II-8  concerning  limits  in  various  modules. 

TOO  MANY  CONSTANTS  SPECIFIED  IN  SOURCE  TEXT. 


The  maximum  number  of  constants  permitted  in  the  specifications  text  has 
been  exceeded.  See  Figure  II-8  concerning  limits  in  various  modules. 
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Labeling  Errors 

DUPLICATE  TRANSFORMATIONS  LABELS. 

More  than  one  statement  has  been  assigned  the  same  label. 

MISSING  TRANSFORMATIONS  LABELS  AS  FOLLOWS—  [statement  label-J 

[statement  label^ ] 


[statement  labe^  ] 


EXECUTION  ERROR  DETECTION  AND  DIAGNOSTICS 


Certain  error  conditions  may  arise  during  execution  of  the  data-trans formation 
specifications.  Where,  in  an  expression,  the  function  of  a  variable  would  not 
yield  a  read,  number  of  allowable  magnitude,  or  where  there  is  division  by  zero, 
the  value  of  the  entire  expression  is  automatically  set  to  -0.  These 
conditions  do  not  result  in  diagnostic  comment.  Certain  floating-point 
operations  (in  arithmetic  terms)  when  operating  on  data  that  are  not  floating¬ 
point  type  may  result  in  internal  register  underflow  or  overflow.  A  diagnostic 
message  is  printed  if  such  a  condition  should  occur.  The  results  in  that  case 
are  suspect  and  the  source  file  should  be  examined  for  bad  data. 


EXAMPLES  OF  DATA  TRANSFORMATIONS 

In  the  following,  three  examples  of  data  tranf formations  are  given.  The  first 
example  represents  a  complete  transformation  program.  The  second  example 
shows  how  to  specify  the  computation  of  a  median.  The  third  is  an  example  of 
the  use  of  the  weighting  function. 

Example  I 


PERFORM  THE  F0LL0WING  *TRANS FORMATIONS , 

'X(l)  ♦  X ( 2 )  /  X(ll)  -X(1001) 

X(l)  *  X(5)  *  X(1002) 

X(U)  ♦  X(7)  LQ  (5.)  GO  TO  $1$ 

X(4)  4  X( 7)  14  (20.)  GO  TO  $2$ 

(0.)  -  X(1003) 

GO  TO  $ST0r$ 

$1$  X ( 38 ) /  x(39)  ♦  X(4)  -  (1003) 
gO  TO  $stO?$ 

$2$  X(4o)/  x(39)  +  x(4)  =  X(1003) 

$stOp$ 

Note  that  the  lust  labeled  statement  may  be  "empty"  (label  $S’lOP$)>  permitting 
a  transfer  to  the  end  of  the  transformations  specifications. 
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Example  II 

Assume  that  the  "median  of  school  years  completed"  is  to  be  calculated  for  a 
population  distribution  by  number  of  school  years  completed.  The  following 
is  the  layout  of  information  in  the  X-region: 


variable  location 


no.  of  persons,  0-6  school  yrs.  completed  X(10) 
no.  of  persons,  6-9  school  yrs.  Completed  X(ll) 
no.  of  persons,  9-12  school  yrs.  completed  X(l2) 
no.  of  persons,  12-14  school  yrs.  completed  X(13) 
no.  of  persons,  14-16  school  yrs.  completed  X(l4) 


To  compute  the  median  of  the  values  in  X(10)  through  X(l4)  it  is  necessary  to 
have  available  in  the  X-region,  in  consecutive  locations,  each  of  the  lower 
limits  of  the  classification  intervals,  e.g.,  0,  6,  9,  12  and  14  and  also  the 
upper  limit  of  the  last  interval,  e.g.,  16,  which  could  be  a  "dummy"  value. 
Notice  that  in  the  transformation  statements  below  the  assignment  of  these 
numbers  to  specific,  although  arbitrary,  X-region  locations  is  shown. 

TRANSFORM  »X(25l)  EQ  (6. )  G0  T0  $MED$ 

(0.)  -  X(250)  (6.0)  -  X(251)  (9.)  -X(252) 

(12.)  -X(253)  (14.)  «X(254)  (16. )  -X(255) 

$MED$  MED  X(10)  THRU  X(l4)  HJT  X(250)  -X(lOOl)’. 


Note  that  MED  is  an  operator,  but  $MED$  is  a  label.  As  the  first  entity  is 
processed,  X-region  locations  X(250)  to  X(255)  contain  zeros.  The  statements 
above  set  the  locations  to  the  desired  quantities.  After  the  first  entity 
there  is  a  6  in  X(251)  so  that,  as  a  result  of  the  first  statement,  the 
statements  setting  up  the  interval  locations  need  not  again  be  executed; 
instead,  the  program  immediately  proceeds  to  $MED$. 

Example  III 

The  weighting  function  has  been  previously  defined,  and  has  several  uses.  One 
possible  use  is  illustrated  below,  in  which,  to  save  writing  transformation 
specifications,  instead  of: 

X(69)  *  X(208)  .  X(100l) 

X(?0)  *  X(208)  -  X(1002)  etc. 

one  could  write: 

X(69)  -  X(1001) 

X(70)  -  X(1002)  etc. 

WGHT  X  (1001)  THRU  X(l — )  BY  X(208) 

If  only  one  variable  is  to  be  weighted  and  label  is  to  be  preserved,  then  one 
might  use  WGHT  X(n)  THRU  X(n)  BY  X(m)  where  n  and  m  are  variable  Indexes. 


4  March  1965 


25 


OT-1563/OltyOl 


III.  ENTITY  STRATIFICATION 

The  stratification  facility  in  SPAN  programs  allows  the  user  to  select  or 
stratify  records  from  an  input  file  by  stating  explicit  conditions  on  the 
properties  associated  with  an  entity.  These  condition  statements  are  evaluated 
during  the  input  processing  of  each  entity  record  (see  Figure  1-1).  If  a 
stated  condition  is  true,  the  entity  is  assigned  the  specified  stratum  number. 

If  the  stated  condition  is  false,  the  next  stated  condition  is  evaluated.  If 
none  of  the  stated  conditions  are  true,  the  stratum  number  is  automatically 
set  at  zero;  the  entity  record  is  thus  rejected  from  further  processing,  and 
the  next  entity  record  is  obtained,  as  shown  in  Figure  1-1. 

Stratification  specifications  are  executed  after  transformations  have  been 
performed  on  the  input  data.  Thus  conditions  may  be  stated  on  both  input 
variables  and  their  transforms.  The  stratum  number  assigned  to  the  entity  is 
placed  into  1(99)  and  is  therefore  available  to  subsequent  processes. 

Entity  stratification  specifications  could  be  expressed  as  a  series  of  condition 
and  transformation  statements.  The  following  transformation  specifications,  for 
example,  will  assign  the  stratum  number  6  to  entities  that  contain  a  3,  or  a  5 
through  10,  in  1(3): 


1(3)  EQ  (3)  G0  10  $CLASSF$ 

1(3)  GQ  (5)  G0  10  $NEXT$ 

G0  10  $DELETE$ 

$NEXT$  1(3)  LQ  (10)  G0  10  $CIASSF$ 

$DELETE$  (0)  =  1(99)  00  T0  $END$ 

$CLASSF$  (8)  »  1(99) 

$END$ 

The  same  stratification  can  be  accomplished  with  the  following  stratification 
specifications: 


ASSIGN  8  IF  1(3)  =  3;  5-10. 

Thus,  one  can  express  more  concisely  a  complex  set  of  conditions  with  respect 
to  sets  of  explicit  values.  However,  transformation  specifications  permit  the 
use  of  variables  and  expressions  as  condition  criteria. 
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STRATIFICATION  STATBffiNTS 


Entity  stratification  specifications  are  composed,  of  an  arbitrary  collection  of 
statements.  These  statements,  called  stratification  statements,  are  made  up  of 
condition  clauses,  and  these,  in  turn,  of  criterion  values.  These  and  other 
elements  of  the  entity  stratification  specifications  cure  discussed  below. 


General  Form 


ASSIGN  n  I?  ^  ,AND  cg  . . .  ,AND  c^ 

where  n  is  a  stratum  number  defined  below,  and 
c^,Cg, ...,0^  are  condition  clauses  defined 

below. 


A  stratification  statement  assigns  a  stratum  number  to  an  entity  if  the  condition 
expressed  by  the  statement  is  true.  The  statement  may  contain  more  than  one 
condition  clause,  in  which  case  the  clauses  are  connected  in  logical -and  fashion 
by  the  t,ANDl  operator.  A  stratification  statement  always  terminates  on  a 
period  ( . ]  . 

The  stratum  number  must,  be  an  unsigned  integer  not  exceeding,  in  most  SPAN 
programs,  2^5  in  magnitude.  In  the  STARS  Files  Tabulator,  however,  the  maximum 
stratum  number  is  80. 

A  condition  clause  asserts  the  equality  of  the  value  of  a  variable  or  code  to  at 
least  one  of  a  set  of  criterion  values. 


General  Form 


p  **  v 

where  p  is  a  code  or  variable,  and 

v  is  a  set  of  criterion  values  of  a  form 
described  below. 


For  example. 


1(3)  -  3,  5-10,  IT,  28-30 


asserts  that  l( 3)  equals  3,  5  through  10,  17,  or  28  through  30. 


h  March  1965 


21 


TM-1563/014/01 


Elements  of  the  condition  clause  are  criterion  values  or  ranges  of  criterion 
values,  as  shown.  The  elements  are  connected  by  commas  [,]  in  logical-or 
fashion,  lhe  limits  of  the  range  are  connected  by  the  [-1  symbol  and  may  be 
stated  in  any  order,  e.g.,  28-30  could  be  expressed  as  30-28. 

Hie  criterion  values  may  be  decimal  constants  (D),  decimal  integer  constants 
(Kl),  octal  integer  constants  (K 2),  or  alphameric  constants  (K3 ) ,  where  the 
symbols  in  parentheses  are  those  used  in  Figure  II-l,  Chapter  II,  (and  defined 
in  that  same  chapter).  Note  that  the  following  element  is  possible:  -.1 — .3, 
for  -.1  through  -.3*  Hie  statement  of  a  range  between  two  criterion  values 
is  interpreted  as  the  range  between  their  binary  code  values.  In  particular, 
HABC-HABD  is  equivalent  to  02122236o6o6o-02122246o6o6o.  In  stratification 
text,  constants  are  not  enclosed  in  parentheses. 

Hie  criterion  values  for  a  code,  e.g.,  1(72),  must  be  of  the  type  Kl,  K2,  or 
K3;  the  criterion  values  for  a  variable,  e.g.,  X(5l)>  must  be  of  the  type  D. 

Hie  following  are  examples  of  stratification  statements: 


ASSIGN  2  IF  1(75)  =  3,  5,  8-10,HJUNK,  AND  X(l0)  =  -.2-55-3. 


ASSIGN  185  IF  X(10)  =  -35.-36.,  39-. 

In  the  second  example,  all  periods  but  the  last  are  interpreted  as  decimal 
points.  Hie  final  period  terminates  the  statement.  Minus  [- ]  symbols  play  a 
similar  dual  role:  the  first  and  third  (- ]  signs  denote  negative  numbers, 
while  the  second  f- ]  sign  defines  the  range  between  -35-  and  -36.. 


HOW  HO  SPECIFY  ENTITT  STRATIFICATION 

Entity  stratification  specifications  that  are  to  operate  on  the  data  input 
(including  data  transformations,  if  any)  to  a  particular  SPAN  Job  must  l>e 
included  as  a  separate  sentence  among  the  control  specifications  for  that  Job. 
In  a  control  sentence,  the  specifications  text  takes  the  form  of  a  string 
predicate  of  an  appropriate  control  word;  that  is,  the  text  is  delimited  by 
apostrophies  and  preceded  by  a  stratification-  or  selection-declaring  control 
word. 
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General  Form 


SELECT 
STRATI fy 
®}SELEct 


J 


s 


where  s  is  the  entity-stratification  specifications 
text  (S6000),  and 

®  stands  for  source  file  number  in  the  case 
of  multifile  input  (©  =  1,2,3  in  the 
STARS  Files  Collator) . 


A  simple  example  of  entity-stratification  specifications,  shoving  the  control 
word  and  the  specifications  text  delimited  by  apostrophies,  would  be 

SELECT  THE  F0LL0WING  0BSERVATI0NS  ’  ASSIGN  1  IF  X(20)=  .0  .  ' . 

In  most  SPAN  modules  stratification  is  used  to  select  records  from  the  source 
file,  and  for  these  modules  the  control  word  is  SELECT.  In  the  STARS  File 
Tabulator,  the  purpose  is  stratification,  and  the  control  word  is  STRATIFY. 

In  the  STARS  File  Collator  where  the  selection  can  be  specified  on  data  from 
any  one  of  three  posrible  source  files,  control  words  }.}$ELEct,  2)SELEct, 
and  '  3)SELEct  introduce  the  first,  second,  and  third  file  entity-stratification 
text  respectively.  Information  on  control  words  usage  in  a  particular  module 
is  given  in  the  SPAN  Control  Words  Glossary,  TM-1563/013/xx • 


ENTITY  STRATIFICATION:  LIMITATIONS 


The  size  of  internal  tables  used  in  the  translation  jf  the  entity-stratification 
specifications  sets  limits,  in  a  particular  SPAN  job,  on  (l)  the  length  of  the 
specifications  text,  (2)  the  maximum  size  of  the  stratum  number,  (3)  the  number 
of  condition  clauses  permitted,  and  (4)  the  number  of  criterion  values 
permitted.  Appropriate  diagnostics  are  printed  if  these  limits  are  exceeded. 

In  all  SPAN  modules,  the  entity-stratification  specifications  text  associated 
with  a  control  word  may  not  exceed  6000  characters  in  length,  (in  the  STARS 
Files  Collator,  where  entity  stratification  may  be  specified  independently  on 
each  of  the  three  possible  source  files,  this  limit  applies  to  each  set  of 
specifications  separately.) 
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Other  limits  vary  by  module  as  shovn  in  Figure  11-9- 


1 _ UMmno,,s _ 1 

Module 

Maximum 
Stratum  Number 

Maximum 

Condition  Clauses 

Maximum 

Criterion  Value* 

Factor  Analysis 

215 

20 

100 

Regression  Analysis 

215 

20 

100 

Latent  Class  Analysis 

215 

20 

100 

Rectangular  Product  Moments 

215 

20 

100 

File  Transformation 

215 

20 

100 

STARS  Files  Abstractor 

215 

20 

100 

STARS  Files  Collator 

a« 

45* 

450* 

STARS  Files  Tabulator 

80 

80 

400 

STARS  Files  Summary-Sort 
_ 

215 

80 

400 

•Combined  total  for  all  source  files.  1 

Figure  II-9 


In  determining  the  number  of  criterion  values  used  in  a  specifications  text 
note  that  each  expressed  range  of  values  counts  as  a  single  value. 


STRATIFICATION  TEXT  ERROR  DETECTION  AND  DIAGNOSTICS 


Before  a  file  is  processed,  the  entity-stratification  specifications  text  is 
translated  into  an  internal  representation  designed  for  more  efficient 
execution.  The  translator  detects  certain  errors  in  the  specifications  text 
and  prints  appropriate  diagnostic  messages.  Errors  detected  are  those  relating 
to  (l)  excessive  size  of  the  stratification  text,  (2)  illegal  statement  syntax, 
and  (3)  violation  of  restrictions  on  the  number  of  elements  of  a  specification 
text.  An  error  detected  in  entity-stratification  specifications  renders  a  SPAN 
job  non-executable.  Control  in  that  case  passes  to  the  next  job  in  line. 

Diagnostic  messages  that  may  result  during  translation  of  entity-stratification 
specifications  are  listed  belov,  with  appropriate  conments  and  examples. 
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Excessive  Size  of  Stratification  Text 


***  ENTITY  SELECTION  SPECIFICATIONS  TEXT  IS  TOO  LONG  (OVER 
6000  CHARACTERS). 

***  ENTITY  STRATIFICATION  SPECIFICATIONS  TEXT  IS  TOO  LONG 
(OVER  6000  CHARACTERS). 

These  messages  result  from  violation  of  the  requirement  that  the  stratification 
text  associated  with  a  control  word  not  exceed  the  maximum  length  of  6000 
characters.  In  calculating  the  length  of  the  stratification  text,  note  that 
each  string- terminating  apostrophe  used  in  the  control  sentence  may  cause  1-5 
blank  characters  to  be  appended  to  the  string  to  maintain  the  internal  string 
length  at  an  integral  multiple  of  six  characters .  Blanks  so  generated  would 
contribute  to  the  character  count  of  the  string. 

Illegal  Statement  Syntax 

WHILE  EXPECTING  AN  -IF-  OR  -AND-  WORD,  PROGRAM  HAS 
ENCOUNTERED  A  WORD  IT  CANNOT  RECOGNIZE. 

A  condition  clause  is  preceded  by  an  illegal  operator,  as  shown  in  the 
following  example: 

***  STRATIFICATION  PROGRAM  DIAGNOSTIC 

WHILE  EXPECTING  AN  -IF-  OR  -AND-  WORD,  PROGRAM  HAS 
ENCOUNTERED  A  WORD  IT  CANNOT  RECOGNIZE. 

ERROR  SENSED  WHILE  SCANNING  THE  UNDERLINED  WORD  IN  THE 
FOLLOWING  PORTION  OF  THE  SOURCE  TEXT  — 

3, 78-92, AND  IF  X(42)= 

*1 

[,AND  IF]  is  not  a  legal  clause  connector.  Similar  error  message  would 
result  if  a  stratification  statement  began  with  [ASSIGN  56  X(gg)*1  . 

ILLEGAL  CHARACTER  IN  AN  OCTAL  CONSTANT. 

Octal  constant  contains  a  character  other  than  digits  0  through  7;  for 
example,  0  380. 

ILLEGAL  CHARACTER  IN  A  DECIMAL  CONSTANT. 

Decimal  number  or  integer  constant  contains  a  character  other  than  the  minus 
[-]  sign,  (e.g.,  3AB,  57+2)  or  digits  0  through  9- 
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DECIMAL  INTEGER  IN  A  DECIMAL  NUMBER  CRITERION  SET. 

A  decimal  number  criterion  set  contains  a  constant  without  a  decimal  point 
[.]  ,  as,  for  example,  in  the  following  text: 

3.5,6., .5-1., 1,  AND  X(7)  = 

Violation  of  Certain  Restrictions 


TOO  MANY  CONDITION  CLAUSES  IN  SOURCE  TEXT. 

Ihe  maximum  number  of  condition  clauses  permitted  in  the  specifications  text 
has  been  exceeded.  See  Figure  II-9  concerning  limits  in  various  modules. 

TOO  MANY  CRITERION  VALUE  SETS  IN  SOURCE  TEXT. 

Ihe  maximum  number  of  criterion  values  permitted  in  the  specifications  text 
has  been  exceeded.  See  Figure  II-9  concerning  limits  in  various  modules. 

Certain  other  violations  of  specification  rules  for  entity  stratification  are 
not  detected  directly.  For  instance,  a  decimal  point  [.]  appearing  in  a 
decimal  integer  constant  would  be  interpreted  as  a  period  t . ]  terminating  the 
statement.  If  the  last  statement  is  not  properly  terminated  by  a  period,  that 
statement  will  be  translated  incorrectly,  and  the  stratification  specifications 
will  not  execute  properly.  A  criterion  value  beginning  with  a  non-numeric 
character  other  than  0  will  be  automatically  interpreted  as  an  alphameric 
constant,  unless,  of  course,  it  is  contained  in  a  decimal  number  criterion  set., 


EXAMPLES  OF  ENTITY  STRATIFICATION 

Three  examples  of  entity  stratification  are  given  below.  The  first  example 
shows  the  straightforward  use  of  stratification  specifications  for  selecting 
entity  records  for  the  source  file.  The  second  and  third  examples  demonstrate 
the  Joint  use  of  data  transformations  and  entity  stratification. 

Example  I 

Assume  a  source  file  contains  l6  records.  Code  1(2)  is  the  record  sequence 
number.  Values  of  variable  X(210)  range  from  -50.7  to  +200.5* 

Records  4,  6  through  8  and  12  in  the  source  file  are  to  be  assigned  stratum 
number  11.  Records  sequence -numbered  1,  5,  13,  and  14  and  containing  values 
of  X(210)  between  0.  and  -50.7  are  to  be  assigned  stratum  number  12.  Records 
sequence-numbered  1,  5,  13,  and  14  and  containing  values  of  X(210)  between 
0.  and  200.5  are  to  be  assigned  stratum  number  13 - 
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The  following  specifications 

will 

perform  the 

SELECT  'ASSIGN 

11 

IF 

1(2) 

ASSIGN 

12 

IF 

1(2) 

X(210) 

ASSIGN 

13 

IF 

1(2) 

x(ao) 

desired  stratification: 

■  4,6*8,12. 

-  I,5,13>l4>  AND 

-  0.  —  50.7. 

-  1,5,13,1^  AND 

-  0.  -  200.5.  '• 


As  a  result,  records  sequence-numbered  2,  3»  9>  10,  11,  15>  and  16  will  not 
satisfy  any  of  the  stated  conditions  and  will  be  excluded  from  further 
processing.  The  stratus  number  values  are  placed  in  I (99)  and  nay  be  used 
in  further  processing,  e.g.,  as  a  key  for  sorting  or  sumnarizing  the 
derivative  file. 


Bcample  II 

This  example  shows  the  joint  use  of  data  transformations  and  entity 
stratification.  Records  are  to  be  selected  on  a  source  file  code,  1(2)  and  if 
a  computed  percentage  exceeds  50^. 

TRANSFORMATIONS  •  SUM  X(3)  THRU  X(l6)  -  X(l00l) 

X(13)  /  X(1001)  *  (100.)  -  X(1002) 

SELECT  'ASSIGN  1  IF  l(2)*l-5,  AND  X  (1002)  *  50.-100. 


Example  III 

The  following  example  again  illustrates  the  use  of  data  transformations  in 
conjunction  with  entity  stratification.  The  purpose  here  is  to  delete  from 
the  derivative  file  all  records  possessing  duplicate  values  of  a  key,  1(2). 

TRANSFORM  '  1(2)  BQ  1(60)  G0T0  $1$ 

(0)  -  I(6l)  G0T0  $2$ 

$1$  (l)  -  I(6l) 

$2$  1(2)  -  1(60)  '. 

SELECT  'ASSIGN  1  IF  l(6l)  -  0.  '. 
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APPEKDIX  A 

7090  REPRESENTATION  OF  ALPHAMERIC  CHARACTERS 


Char¬ 

acter 

Card 

BCD 

Tape 

Storage 

Cha'- 

acter 

J 

Card 

BCD 

Tape 

Storage 

Char¬ 

acter 

Card 

BCD 

Tape 

Storage 

Char¬ 

acter 

Card 

BCD 

Tape 

Storage 

■ 

12 

11 

H 

1 

1 

01 

01 

HI 

1 

61 

21 

J 

1 

41 

41 

/ 

1 

21 

61 

■ 

12 

B 

11 

0 

2 

2 

02 

02 

b 

2 

62 

22 

fl 

2 

42 

42 

S 

2 

22 

62 

12 

■ 

■ 

0 

3 

3 

03 

03 

c 

3 

63 

23 

B 

3 

43 

43 

fl 

3 

23 

63 

■ 

■ 

B 

12 

n 

B 

B 

1 

1 

04 

04 

4 

64 

24 

M 

fl 

1 

44 

u 

fl 

24 

64 

— 

wm 

12 

n 

m 

0 

5 

S 

05 

. 

D 

E 

5 

65 

25 

N 

s 

45 

45 

B 

5 

_ 

25 

65 

■ 

■ 

12 

« 

B 

B 

06 

06 

1 

6 

66 

26 

0 

6 

_ 

46 

46 

w 

B 

26 

66 

■ 

■ 

12 

■ 

B 

B 

B 

07 

07 

C 

7 

67 

27 

B 

47 

B 

B 

27 

67 

■ 

12 

11 

B 

B 

B 

8 

10 

10 

H 

8 

__ 

70 

30 

p 

_ 

8 

50 

50 

B 

B 

30 

70 

1 

12 

■ 

11 

B 

9 

11 

11 

I 

9 

71 

31 

B 

9 

51 

51 

z 

B 

31 

71 

blank 

blank 

20 

60 

■ 

12 

60 

20 

■ 

11 

40 

40 

■ 

a 

12 

00 

I 

I 

12 

n 

11 

II 

0 

1 

8-3 

13 

13 

B 

8-3 

73 

33 

B 

8-3 

53 

53 

B 

8-3 

33 

73 

11 

12 

■ 

m 

B 

0 

8-4 

14 

14 

B 

8-4 

74 

34 

m 

D 

54 

54 

B 

34 

74 

Ihis  table  has  been  reprinted  by  permission  from  IBM  7090/7094 
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FUNCTIONS  OF  VARIABLES 


Square  root 

SORT 

X(90) 

Logarithm  (natural) 

L0G 

X(90) 

Exponential 

EXP 

X(90) 

Sine 

SUJ 

X(90) 

Cosine 

C0S 

x(90) 

Arctangent 

ATAN 

x(90) 

Figure  II-2 


The  function  of  a  variable  is  a 
variable  would  not  yield  a  real 
log  0,  or  e^°,  the  value  of  the 


single  operand.  Where  the  function  of  a, 
number  of  allowable  magnitude,  as  in  (-4)2, 
function  is  set  to  -0. 


DECIMAL  NUMBER  CONSTANTS  (D) 


General  Form 


A  decimal  number  constant  consists  of  a  string  of  decimal 
'iipits  v±ik  u  at  the  bt^irxi.^  a-  *he  er.d, 
or  between  two  digits. 


The  decimal  number  constant  may  be  signed  or  unsigned.  A  decimal  number 
is  represented  internally  as  a  floating-point  number.  .The  magnitude  of  a 
decimal  number  constant  must  lie  between  lO"1^  and  lO1^,  or  be  zero. 
Examples:  (-.15);  (3*l4l6),  (12.)  or  (0.). 


DECIMAL  INTEGERS  (Cl  or  KL) 


General  Form 


A  decimal  integer  constant  consists  of  1-5  decimal  digits 
written  without  a  decimal  point. 

A  decimal  integer  code  is  represented  by  l(n),  where  n  is 
the  index  of  its  location  in  the  I-region. 
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A  decimal  integer  may  be  signed  or  unsigned.  Its  magnitude  must  be  less  than 
2^.  A  decimal  integer  constant  would  be  expressed  as  (1^92),  (-2),  or  (0). 

A  decimal  integer  code  is  represented  by  its  address,  for  example,  l(92). 
Decimal  integers  are  stored  internally  as  pure  binary  numbers,  occupying 
the  left-half  word,  with  the  right  l8  binary  bits  being  all  zeros.  If  a 
decimal  integer  is  negative,  the  minus  sign  occupies  the  high-order  position 
of  the  binary  word;  this  must  be  remembered  when  using  negative  integers  in 
Boolean  operations. 


OCTAL  INTEGERS (C2  or  K2) 


General  Form 


An  octal  integer  constant  consists  of  1-12  octal  digits 
preceded  by  the  letter  0. 

An  octal  integer  code  is  represented  by  l(n),  where  n 
is  the  index  of  its  location  in  the  I-region. 


The  maximum  value  of  an  octal  integer  is  2~$U.  Leading  zeros  need  not  be 
indicated.  Examples;  (0356),  (07777OOOO7777),  (03) •  An  octal  integer 
code  is  represented  by  its  address,  for  example,  l(92).  Octal  integers 
are  stored  internally  as  pure  binary  numbers,  each  octal  digit  comprising 
three  binary  bits. 


ALPHAMERIC  CONSTANTS  AND  CODES  (C3  or  K3) 


General  Form 


An  alphameric  constant  consists  of  1-6  alphabetic 
and/or  numeric  characters  preceded  by  the  letter  H. 

An  alphameric  code  is  represented  by  l(n),  where  n 
is  the  index  of  its  location  in  the  I-region. 


An  alphameric  constant  may  not  include  special  characters,  but  may  include  blank 
spaces  to  be  treated  as  characters.  Ihe  H  to  indicate  mode  must  be  used  even  if 
word  begins  with  an  H.  Examples;  (HABC),  (H2376),  (H23KV6),  (HHAPPY).  If  the 
string  exceeds  six  characters,  only  the  six  left-most  characters  are  used*  For 
example,  (HALPHAMERIC)  is  equivalent  to  (HAIPHAM),  i.e.,  the  first  six  characters* 
Blank  spaces  occurring  within  the  six  character  positions  following  an  H  are 
treated  (and  counted)  as  characters. 
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An  alphameric  code  is  represented  by  its  address,  for  example:  1(92)* 

Alphameric  words  are  stored  internally  as  binary  numbers,  with  two  octal  digits, 
and  therefore  six  binary  bits,  for  each  character.  They  are  left- justified, 
so  that  the  right  characters  consist  of  blanks.  The  IBM  7090  octal  (storage) 
representation  of  alphameric  characters  is  given  in  Appendix  A  ,  "7090 
Representation  of  Alphameric  Characters." 


COMPLEMENT  OF  OCTAL  INTEGER  CODE  (cl) 

The  l’s  complement  of  an  octal  integer  code  is  indicated  by  placing  a  [*] 
symbol  ahead  of  the  address  of  a  code,  for  example,  -1(15).  The  combination 
represents  a  single  operand. 


ARITHMETIC  TERMS 


Arithmetic  terms  may  be  simple  or  complex.  A  simple  arithmetic  term  consists 
of  one  operator  and  one  operand.  A  complex  arithmetic  term  represents  a 
single  operation  performed  on  more  than  one  operand.  Figure  II-3  lists  the 
operators  that  may  be  used  in  simple  arithmetic  terms;  Figure  11-^  lists  the 
two  types  of  complex  terms.  Only  certain  operands  may  be  used  with  a  given 
operator;  these  are  indicated  in  Figures  II-3  and  II-4  by  means  of  the 
operand  symbols  defined  in  Figure  II-l. 


[  SIMPLE  ARITHMETIC  TERM  OPERATORS  | 

Or "r at or 

Operation 

■ yplicaole  to  Operand  Types 

+ 

Add 

V,  F,  D 

- 

Subtract 

V,  F,  D 

* 

Multiply 

V,  F,  D 

/ 

Divide 

V,  F,  D 

T0 

Store  and  proceed 

V 

S 

Store  and  clear 

V 

Figure  II-3 

Division  by  zero  will  set  the  quotient  to  -0.  The  difference  between  the  two 
"store"  operators  will  be  discussed  in  the  section  on  expressions. 
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COiiPLEX  ARITHMETIC  TERMS  | 

Term 

Operation 

Admissible  Operand  Types 

SUM  x1  THRU  x2 

Sum  operands  x^  through 
x2 

V 

MED  x1  THRU  x>2  INT  x^ 

Calculate  median  over 
variables  x^  through  x2, 
with  x_  being  the  first 

element  of  a  set  of 
the  lower  limits  of 
corresponding  category 
intervals;  if  n  is  the 
number  of  categories,  the 
(n+l)th  element  contains 
the  upper  limit  of  the  nth 
interval. 

V 

Figure  II -4 

A  detailed  example  of  a  median  computation  will  be  given  later. 


BOOLEAN  TERMS 


Boolean  terms  consist  of  one  operator  and  one  operand.  They  are  listed  in 
Figure  11-5*  Only  certain  operands  may  be  used  with  a  given  operator;  these 
are  indicated  in  Figure  II-5  with  the  operand  symbols  defined  in  Figure  II-1. 


j  BOOLEAN  TERM  OPERATORS  | 

Symbol 

Operation 

Applicable  to  Operand  lVPes 

+ 

logical-or 

All  C  and  K2,  K3 

* 

logical -and 

All  C  and  K2,  K3 

/ 

exclusive -or 

All  C  and  K2,  K3 

LEFT 

shift  left  n  bits 

K1 

RIGHT 

shift  right  n  bits 

ra. 

T0 

store  and  proceed 

All  C 

S 

store  and  clear 

All  C 

Figure  II-5 
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•or  Boolean  operations,  including  shifts,  recall  that  all  operands  are  internally 
represented  as  pure  binary  quantities  of  36  binary  bits.  Shifts  are,  in  effect, 
accumulator  shifts,  not  "long"  or  "logical"  shifts;  therefore  bits  are  lost  in 
this  process.  On  a  left  shift  of  n  bits,  (n-l)  bits  are  lost,  while  n  bits 
are  :  cst  on  a  right  shift  of  n  bits.  Vacated  positions  are  filled  witli  'zeros. 

INTEGER  TERMS 


Integ-  terms  consist  of  an  operator  and  an  operand.  Hie  operators,  and 
applicable  operands,  urr  given  in  Figure  II -6. 


INTEGER  TERM  OPERATORS 

I  Applicable  to  Operand  Type  Cl 

I  Applicable  to  Operand  Type  K1  1 

1  (Dec 

imal  Integer  Code) 

(Decimi 

al  Integer  Constant) 

■a 

Operation 

Symbol 

Operation 

t-x 

Integer  Code  Add 

+ 

Integer  Constant  Add 

-X 

Integer  Code  Subtract 

- 

Integer  Constant  Subtract 

*x 

Integer  Code  Multiply 

* 

Integer  Constant  Multiply 

/X 

Integer  Code  Divide 

/ 

Integer  Constant  Divide 

=x 

Store  and  Clear 

Figure  II-6 


EXPRESSIONS 


An  expression  is  a  string  of  terms,  none  of  which  contains  the  operator  [ 
or  [=X].  Each  line  below  is  an  example  of  one  expression: 


Err.-r^^ion  Type 

X(52)  -  X(43)  /  (2.84)  Arithmetic 

1(2)  *  ( 0770000 )  LEFT  (l8)  *  (H0UKE)  Boolean 

(.55)  -  EQRTX(55)  -  X(12)  Arithmetic 

(0707)  /  1(87)  +  -1(3)  Boolean 

XI(52)  /X  1(33)  *X  1(1)  Integer 

MED  X(14)  THRU  X ( 21 )  INT  x(38)  T0  X(8o)  *  (2.)  Arithmetic 
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Blanks  are  ignored  in  expressions,  so  that  expressions  equivalent  to  the 
above  can  be  written  in  many  ways.  In  the  above  examples ,  each  of  the 
following  represents  a  single  term:  -X(43);  LEFT  (l8);  -  SQRTX(55);  -  -I (3); 
/X  1(33);  T0  X(80);  MED  X(l4)  THRU  X(21)  INT  X(38). 


RULES  FOR  CONSTRUCTING  EXPRESSIONS 


1.  All  terms  in  an  expression  must  be  of  the  same  type;  e.g.,  all  arithmetic 
terms,  Boolean  terms,  or  integer  terms.  Mixed  expressions  are  not  detected 
by  the  translator  and  will  yield  incorrect  results. 

2.  A  complex  arithmetic  term  may  appear  only  as  the  leading  term.  Thus  the 
following  is  a  correctly  formed  expression, 

SUMX(35)  THRU  X(4o)  *  X(2)/(l00.) 

while  the  following  expression  is  in  error, 

X{2)  *  SUMX(35)  InHU  X(40)/(100.) 

3.  For  expressions  made  up  of  either  simple  arithmetic  or  Boolean  terms  the 
operator  of  the  leading  term  is  always  an  implicit  [+ ]  .  'That  is,  no 
operator  except  the  (+1  may  be  used  for  the  leading  term,  and  the  (+  ] 

is  never  written.  (See  the  previously  given  examples.) 

4.  Expressions  are  evaluated  from  left  to  right;  the  operator  symbol  in  each 
term  defines  the  relation  between  the  result  of  preceding  terms  and  the 
value  referred  to  bj  .\c  rpc’-and.  In  the  expression  X(?2)  -  X(4^)  /  (2.84) 
the  quantity  Xi*o,  subtracted  from  X(52)  and  the  result  is  then  divided 
by  (2.84).  If  one  wanted  to  compute  X(43)  cuvided  by  (2.84)  anl  the 
quotient  to  be  then  subtracted  from  X( 52^  one  would  need  to  first  perform 
the  division  and  then  store  the  quotient  in  a  temporary  location,  as  will 
be  discussed  in  the  section  on  Statements.  In  some  cases,  other  methods 
can  be  used;  thus,  if  the  operator  ahead  of  X(43)  were  a  +  instead  of  a  -, 
one  could  write  X(43)  /  (2.84)  ♦  X(52).  In  general,  if  ©  is  a  typical 
operator  and  x  is  a  typical  operand,  then  the  expression 

Xie2X2®3x3^X4 


is  evaluated  as  if  it  read 

(((x^x^Gyc^t^x^) 

However,  parentheses  may  not  be  used  for  this  purpose  in  an  expression. 
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